SemanticScuttle - klotz.me » Tags: llm+simon willison

Tags: llm* + simon willison*

0 bookmark(s) - Sort by: Date ↓ / Title /

Setting up a codebase for working with coding agents

Tips for setting up a codebase to be more productive with AI coding tools, including automated tests, interactive testing, issue tracking, documentation, and linters/formatters.

2025-10-26 Tags: coding agents, assisted programming, pytest, llm, simon willison by klotz

NVIDIA DGX Spark: great hardware, early days for the ecosystem

Simon Willison received a preview unit of the NVIDIA DGX Spark, a desktop "AI supercomputer" retailing around $4,000. He details his experience setting it up and navigating the ecosystem, highlighting both the hardware's impressive specs (ARM64, 128GB RAM, Blackwell GPU) and the initial software challenges.

Key takeaways:

* **Hardware:** The DGX Spark is a compact, powerful machine aimed at AI researchers.
* **Software Hurdles:** Initial setup was complicated by the need for ARM64-compatible software and CUDA configurations, though NVIDIA has significantly improved documentation recently.
* **Tools & Ecosystem:** Claude Code was invaluable for troubleshooting. Ollama, `llama.cpp`, LM Studio, and vLLM are already gaining support for the Spark, indicating a growing ecosystem.
* **Networking:** Tailscale simplifies remote access.
* **Early Verdict:** It's too early to definitively recommend the device, but recent ecosystem improvements are promising.

2025-10-15 Tags: nvidia, dgx spark, simon willison, llm, hardware by klotz

Announcing Toad - a universal UI for agentic coding in the terminal

Simon Willison discusses Toad, a new terminal coding assistant built by Will McGugan using Textual. It aims to improve upon existing tools like Claude Code and Gemini CLI by avoiding flicker and offering better interaction with terminal output. Toad is currently in private preview, available through GitHub sponsorship.

2025-07-23 Tags: open-source, markdown, simon willison, will-mcgugan, generative-ai, llms, uv, coding-agents by klotz

Using Claude Code to build a GitHub Actions workflow

The article details the author's use of Claude Code to add a feature to a GitHub repository: an automatically updated README index. It's accompanied by a 7-minute video demonstrating the process.

2025-07-04 Tags: llm, github actions, anthropic, claude, coding, agents, youtube, screencast, simon willison by klotz

Phoenix.new is Fly’s entry into the prompt-driven app development space

An article detailing Phoenix.new, Fly.io's AI-assisted app development platform built on Phoenix and Elixir. It explores the platform's capabilities, the author's experience building a notebook application with it, and its potential for expansion beyond Elixir.

2025-06-24 Tags: erlang, sqlite, fly, llm, agents, vibe-coding, simon willison by klotz

Design Patterns for Securing LLM Agents against Prompt Injections

This article discusses a new paper outlining design patterns for mitigating prompt injection attacks in LLM agents. It details six patterns – Action-Selector, Plan-Then-Execute, LLM Map-Reduce, Dual LLM, Code-Then-Execute, and Context-Minimization – and emphasizes the need for trade-offs between agent utility and security by limiting the ability of agents to perform arbitrary tasks.

2025-06-13 Tags: cybersecurity, prompt injection, llm, simon willison by klotz

Large Language Models can run tools in your terminal with LLM 0.26

LLM 0.26 introduces tool support, allowing LLMs to access and utilize Python functions as tools. The article details how to install, configure, and use these tools with various LLMs like OpenAI, Anthropic, Gemini, and Ollama models, including examples with plugins and ad-hoc functions. It also discusses the implications for building 'agents' and future development plans.

2025-05-27 Tags: llm, tools, openai, agents, plugins, python, function calling, mcp, simon willison by klotz

Building software on top of Large Language Models

A summary of a workshop presented at PyCon US on building software with LLMs, covering setup, prompting, building tools (text-to-SQL, structured data extraction, semantic search/RAG), tool usage, and security considerations like prompt injection. It also discusses the current LLM landscape, including models from OpenAI, Gemini, Anthropic, and open-weight alternatives.

2025-05-16 Tags: self-hosted, llm, embeddings, gemini, vision, tools, simon willison by klotz

Feed a video to a vision LLM as a sequence of JPEG frames on the CLI (also LLM 0.25)

This article details a new plugin, llm-video-frames, that allows users to feed video files into long context vision LLMs (like GPT-4.1) by converting them into a sequence of JPEG frames. It showcases how to install and use the plugin, provides examples with the Cleo video, and discusses the cost and technical details of the process. It also covers the development of the plugin using an LLM and highlights other features in LLM 0.25.

2025-05-06 Tags: ffmpeg, llm, vision, video, jpeg, simon willison by klotz

Understanding the recent criticism of the Chatbot Arena

An analysis of the recent paper 'The Leaderboard Illusion' which critiques the Chatbot Arena's LLM evaluation methodology, focusing on issues with private testing, unfair sampling, and potential gaming of the leaderboard. It also explores OpenRouter as a potential alternative ranking system.

2025-05-01 Tags: llm, benchmarks, openrouter, chatbot arena, simon willison by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

Tags: llm* + simon willison*

Linked Tags

Related Tags